On a hybrid MPI-Pthread approach for parallel branch-and-bound

نویسندگان

  • Juan F. R. Herrera
  • Leocadio G. Casado
  • Eligius M. T. Hendrix
چکیده

A pure MPI code is not necessarily the best solution to obtain the maximum performance [3]. When the search space is irregular, load balance problems usually appear and they may deteriorate performance as the number of MPI processes increases. For the code to scale to a larger number of cores, one solution is to combine MPI with a threaded model which has load balancing capabilities. The POSIX threads (Pthreads) model used here permits dynamic load balancing through thread generation facilitating to handle irregular data structures as those presented in B&B algorithms [4]. A mixture of MPI and Pthreads is studied. An MPI model is used for the initial work distribution and gathering the final solution. A threaded model (with intrinsic dynamic load balancing) is used for parallelism within a node. This will generally produce a code with better scaling properties than the MPI approach without dynamic workload balance. Every MPI process performs a sequential phase in which the feasible region is divided by face-to-face vertex triangulation. If the number of generated subsimplices is less than the number of processes, a sequential B&B process is initiated until the number of unexamined simplices becomes equal to the number of MPI processes. Then, each MPI process selects the corresponding simplices it is in charge of. In this schema there is not communications between MPI processes. Therefore, upper bounds are not interchanged and the load balancing is static. This model is similar to [2]. Within each MPI process, the execution starts using one thread. A thread can create a new thread if a core is idle, there is enough work to share with the new thread and the maximum number of threads is not reached. The new generated thread will receive half of the simplices stored in its parent [4]. The best upper bound is shared between the 8 16 32 64 128

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Heuristic approach to solve hybrid flow shop scheduling problem with unrelated parallel machines

In hybrid flow shop scheduling problem (HFS) with unrelated parallel machines, a set of n jobs are processed on k machines. A mixed integer linear programming (MILP) model for the HFS scheduling problems with unrelated parallel machines has been proposed to minimize the maximum completion time (makespan). Since the problem is shown to be NP-complete, it is necessary to use heuristic methods to ...

متن کامل

Parallel Homologous Search with Hirschberg Algorithm: A Hybrid MPI-Pthreads Solution

In this paper, we apply two different parallel programming model, the message passing model using Message Passing Interface (MPI) and the multithreaded model using Pthreads, to protein sequence homologous search. The protein sequence homologous search uses Hirschberg algorithm for the pair-wise sequence alignment. The performance of the homologous search using the MPI-Pthread is compared to the...

متن کامل

Flexible flowshop scheduling with equal number of unrelated parallel machines

This article addresses a multi-stage flowshop scheduling problem with equal number of unrelated parallel machines. The objective is to minimize the makespan for a given set of jobs in the system. This problem class is NP-hard in the strong sense, so a hybrid heuristic method for sequencing and then allocating operations of jobs to machines is developed. A number of test problems are randomly ge...

متن کامل

Hybrid MPI-thread parallelization of adaptive mesh operations

Many of the world’s leading supercomputer architectures are a hybrid of shared memory and network-distributed memory. Such an architecture lends itself to a hybrid MPI-thread programming model. We first present an implementation of inter-thread message passing based on the MPI and pthread libraries. In addition, we present an efficient implementation of termination detection for communication r...

متن کامل

An Improved DSM System Design and Implementation

In this paper, an Improved Distributed Shared Memory (IDSM) system, a hybrid version of shared memory and message passing version is proposed. This version effectively uses the benefits of shared memory in terms of ease of programming and message passing in terms of efficiency. Further it is designed to effectively utilize the stateof-art multicore based network of workstations and supports sta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013